Bayesian group sparse learning for music source separation
نویسندگان
چکیده
Nonnegative matrix factorization (NMF) is developed for parts-based representation of nonnegative signals with the sparseness constraint. The signals are adequately represented by a set of basis vectors and the corresponding weight parameters. NMF has been successfully applied for blind source separation and many other signal processing systems. Typically, controlling the degree of sparseness and characterizing the uncertainty of model parameters are two critical issues for model regularization using NMF. This paper presents the Bayesian group sparse learning for NMF and applies it for single-channel music source separation. This method reconstructs the rhythmic or repetitive signal from a common subspace spanned by the shared bases for the whole signal and simultaneously decodes the harmonic or residual signal from an individual subspace consisting of separate bases for different signal segments. A Laplacian scale mixture distribution is introduced for sparse coding given a sparseness control parameter. The relevance of basis vectors for reconstructing two groups of music signals is automatically determined. A Markov chain Monte Carlo procedure is presented to infer two sets of model parameters and hyperparameters through a sampling procedure based on the conditional posterior distributions. Experiments on separating single-channel audio signals into rhythmic and harmonic source signals show that the proposed method outperforms baseline NMF, Bayesian NMF, and other group-based NMF in terms of signal-to-interference ratio.
منابع مشابه
Bayesian Group Sparse Learning for Nonnegative Matrix Factorization
Nonnegative matrix factorization (NMF) is developed for parts-based representation of nonnegative data with the sparseness constraint. The degree of sparseness plays an important role for model regularization. This paper presents Bayesian group sparse learning for NMF and applies it for single-channel source separation. This method establishes the common bases and individual bases to characteri...
متن کاملBayesian Modelling of Music: Algorithmic Advances and Experimental Studies of Shift-Invariant Sparse Coding
In order to perform many signal processing tasks such as classification,pattern recognition and coding, it is helpful to specify a signal model interms of meaningful signal structures. In general, designing such a modelis complicated and for many signals it is not feasible to specify the ap-propriate structure. Adaptive models overcome this problem by learningstructures from...
متن کاملBayesian Modelling of Music: Algorithmic Advances and Experimental Studies of Shift-Invariant Sparse Coding
In order to perform many signal processing tasks such as classification,pattern recognition and coding, it is helpful to specify a signal model interms of meaningful signal structures. In general, designing such a modelis complicated and for many signals it is not feasible to specify the ap-propriate structure. Adaptive models overcome this problem by learningstructures from...
متن کاملNonparametric Bayesian sparse factor analysis for frequency domain blind source separation without permutation ambiguity
Blind source separation (BSS) and sound activity detection (SAD) from a sound source mixture with minimum prior information are two major requirements for computational auditory scene analysis that recognizes auditory events in many environments. In daily environments, BSS suffers from many problems such as reverberation, a permutation problem in frequency-domain processing, and uncertainty abo...
متن کاملMulti Snapshot Sparse Bayesian Learning for DOA Estimation
March 1, 2016 The directions of arrival (DOA) of plane waves are estimated from multi-snapshot sensor array data using Sparse Bayesian Learning (SBL). The prior source amplitudes is assumed independent zero-mean complex Gaussian distributed with hyperparameters the unknown variances (i.e. the source powers). For a complex Gaussian likelihood with hyperparameter the unknown noise variance, the c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- EURASIP J. Audio, Speech and Music Processing
دوره 2013 شماره
صفحات -
تاریخ انتشار 2013